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ABSTRACT We isolated overlapping cDNA clones corre- 
sponding to the migor MET protooncogene transcript. The 
cDNA nucleotide sequence contained an open reading frame of 
1408 amino adds with features characteristic of the tyrosine 
kinase family of growth factor receptors. Tliese features 
include a putative 24-amino acid signal peptide and a candi- 
date, hybrophobic, membrane-siianning segm«it of 23 amino 
adds, which defines an extracellular domain 1^926 amino adds 
that coidd serve as a ligand-binding domain. A putative 
intracellular domain 435 amino acids long shows high homol- 
ogy with the SRC family of tyrosine kinases and within the 
kinase domain is most homologous with the human insulin 
receptor (44%) and v-abl (41%). Despite these similarities, 
however, we found no apparent sequence homology to other 
growth factor receptors in the putative ligand-binding domain. 
We conclude from these results that the MET protooncogene is 
a cell-surface receptor for an as-yet-unknown ligand. 



The MET oncogene was isolated from a human osteogenic 
sarcoma cell line (HOS) treated in vitro with the chemical 
carcinogen A^-methyl-N'-nitro-N-nitrosoguanidine (MNNG) 
(1, 2). Activation of MET occurred by way of a DNA 
rearrangement in MNNG-treated HOS cells (MNNG-HOS 
cells) that fused sequences from the TPR (translocated 
promoter region) locus on chromosome 1 to sequences from 
the AfETIocus on chromosome 7 (3, 4). Nucleotide sequence 
analysis of a portion of the MET oncogene (5) showed that it . 
is a member of the tyrosine kinase family of oncogenes (6) and 
in the kinase domain is most homologous to the human insulin 
receptor (HIR) (7) and the murine v-abl oncogene (8). 

The fused TPR-MET oncogene expresses a S.O-kilobase 
(kb) transcript (3) that encodes a 65-kDa protein that is 
phosphorylated in vivo on tyrosine and serine and in vitro 
only on tyrosine (9) (M. Gonzatti-Haces, personal commu- 
nication). The activation of the MET oncogene by fusion of 
sequences from chromosome 1 to chromosome 7 resembles 
the mechanism of activation for several members of the 
tyrosine kinase family of oncogenes. This includes the 
rearranged human BCR-ABL gene on the Philadelphia chro- 
mosome translocation found in chronic myeloid leukemia 
(10) and the TRK oncogene isolated from a human colon 
carcinoma (11). The TPR locus may provide coding se- 
quences to the MET oncogene locus or alternatively MET 
activation may be analogous to the s-erbB transforming gene 
of avian myeloblastosis vims that encodes a truncated 
portion of the epidermal growth factor receptor (EGFR) with 
an altered kinase activity (12). 

The MET protooncogene is expressed predominantly as a 
9.0-kb RNA species in human fibroblast and epithelial cell 
lines (3). However, both HOS and MNNG-HOS cell lines 
express additional M£7-related RNAs of 6.0 kb and of 7.0 kb. 



The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked advertisement^ 
in accordance with 18 U.S.C. §1734 solely to indicate this fact. 



Three MET-related proteins of 110, 140, and 160 kDa can be 
immunoprecipitated with a MET C-terminal anti-peptide 
antibody from human cell lines expressing the 9.0-kb MET 
RNA (9) (M. Gonzatti-Haces, personal conununication). The 
140-kpa protein species has tyrosine Icinase-specific activity 
in vitro and is phosphorylated on serine and threonine in vivo. 
From these analyses we concluded that MET may be similar 
in function to either membrane growth factor receptors or to 
nonintegral membrane proteins such as the abl and SRC 
members of the tyrosine kinase gene family (6). 

In this paper we describe the preparation and nucleotide 
sequence of cDNA clones covering 7.0 kb of the mRNA 
expressed by the MET protooncogene locus. § Using these 
cDNAs as probes in Southern blot analyses, we estimate that 
the MET genomic locus is at least 100 kb long. The cDNA 
sequence has a single open reading frame that translates into a 
protein 1408 amino acids long possessing a structure charac- 
teristic of members of the tyrosine kinase growth factor receptor 
family. However, sequence homology to other growth factor 
receptor proteins is restricted to the kinase domain. 

MATERIALS AND METHODS 

Preparation of cDN A Libraries. Two cDNA libraries were 
constructed with poly(A) RNA isolated from the HOS cell 
line. An S-;ig sample was treated with 1 mM methylmercu- 
ry(II) for 10 min at room temperature and neutralized with 10 
mM 2-mercaptoethanol. The cDNA was made according to 
the method of Gubler and Hoffman (13). Excess linkers 
(EcoRI) were removed when cDNA was size-selected on a 
Bio-Gel A-50m column. From a total of ==200 ng of cDNA 
obtained, 40 ng was cloned in the XgtlO vector to yield »»10^ 
recombinant plaques for each library. 

Isolation of RNA and Blot Hybridization. Total cellular 
RNA was prepared using the guanidinium/cesium chloride 
method (3). Poly(A)^ RNA (5 /ng) was denatured with 
formamide, fractionated by electrophoresis on 1.2% formal- 
dehyde/agarose gels, transferred to nitrocellulose, and hy- 
bridized with 2 X 10^ cpm of probe per ml (3). 

Nucleotide Sequencing. DNA sequences were determined 
by the dideoxynucleotide-termination method (14, 15) after 
subcloning the restriction endonuclease fragments into 
M13mpl8 and mpl9 phage vectors. 

RESULTS 

Isohition and Characterization otMET Protooncogene cDN A 
Clones. To isolate the 5' portion for the MET protooncogene 
9.0-kb RNA, we constructed two cDN A libraries from HOS 
cell poly(A) RNA. One library was constructed from 



Abbreviations: HOS, human osteogenic sarcoma; MNNG, N- 

methyl-A^'-nitro-N-nitrosoguamdine; HIR. human insulin receptor; 

EGFR, epidermal growth factor receptor. 

fiThe sequence reported in this paper is being deposited in the 
EMBL/GenBank data base (Bolt, Beranek, and Newman Labora- 
tories, Cambridge, MA, and Eur. Mol. Biol. Lab., Heidelberg) 
(accession no. J02958). 
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oligo(dT)-primed cDNA enriched in double-stranded cDNA 
molecules >1.5 kb. The second library was primed with a 
30-mer oligonucleotide (see Fig. 2) corresponding to MET 
open reading frame in fragment D (Fig. L4) (3). Both libraries 
were constructed using the XgtlO vector system. A third 
cDNA library prepared from human A431 cell RNA was also 
screened. Thus, all cDNAs isolated represent portions of the 
MET protooncogene transcripts of 6, 7, or 9 kb in HOS cells 
or 9 kb in A431 cells and not the TP/^-AfET oncogene RNA 
of 5.0 kb expressed in MNNG-HOS cells (3). 

Initially 200,000 phage from the olig^dT)-primed cDNA 
library from HOS cell RNA were screened with a MET 
3'-region probe (fragment G, Fig. LA). Seventeen positive 
plaques were identified, and all phage contained a »1.8-kb 
EcoRi restriction fragment corresponding to the 3' end of the 
MET cDNA (from an internal EcoRl restriction site to the 
linker). The largest cDNA clone (Xmetll) contained an addi> 
tional 2.1-kb Eco¥l restriction fragment that hybridized with 
probes further upstream in the A/CT oncogene locus. A cDNA 
library prepared from human A431 cells was screened with 
probe G, and 15 positive clones were identified. The largest of 
these, pmetlO, contains a 1.6-kb insert that overlaps with 
Mnetll. 

The unamplified oligonudeotide-primed HOS cDNA li- 
brary was screened using probe D, which contains the primer 
sequence, and the upstream probe fragment C (Fig. L4). A 
total of 30 positive plaques were identified, and two inde- 
pendent recombinants (XmetS and -14, Fig. lA) containing the 
largest cDNA inserts («>4.0 kb) were further characterized. 
We found that XmetS and -14 were indistinguishable by 
restriction endonuclease analyses [e.g., XmetS and -14 were 
cleaved by EcoRI into three fragments of 2.3, 1.3, and 0.3 kb 
(Fig. U)]. The 1.3-kb fragment of XmetS and -14 hybridized 
with fragments D and C of the rP/t-AfET oncogene locus (3) 
and corresponds to the most 3' portion in XmetS and -14. AH 
further analyses were performed with the XmetS insert. 

To determine the relationship of the MET cDN A sequences 
to the characterized MET RNA transcripts (3), we prepared 
probes from the 2.3-kb and 1.3-kb subfragments of XmetS, the 
1.6-kb fragment of pmetlO, and the 2.2-kb and 1.8-kb fragments 
of Xmetll. These were hybridized to RNA prepared from 
MNNG-HOS. HOS, and CALU-1 ceUs (Fig. 1 B and Q. The 
1.3-kb fragment of XmetS (Fig. 14) plus the pmetlO and Xmetll 
fragment probes all hybridized with the three MET RNA 
species (9.0. 7.0, and 6.0 kb) in CALU-1, HOS, and MNNG- 
HOS ceUs and to the S.O-kb TPRr44ET oncogene RNA in 
MNNG-HOS ceUs (Fig. IS). However, a SOO-base-pair probe 
derived from a 5' subfragment of the 2.3-kb EcoRI fragment of 
XmetS hybridized only to the 9.0-kb MET protooncogene 
transcript (Fig. IQ. 



The restriction map of the overlapping cDNA clones as 
represented in the 9.0-kb MET protooncogene RNA is 
depicted in Fig. 1 A . We have not excluded the possibility that 
the 7.0 kb of overiapping cDNA does not fully represent the 
9.0-kb MET transcript. However, in heteroduplex analysis 
the size and complexity of the human cDN A clones compare 
favorably to a 6.8-kb mouse met cDN A (A. Iyer, unpublished 
results). Furthermore, we have isolated overlapping cDNA 
from several independent libraries and find no evidence from 
restriction analysis for heterogeneity. The 9.0-kb RNA size is 
most likely an overestimate from gel electrophoretic mobility 
measurement of the size of the actual RNA transcript (3). 

Nucleotide Sequence otMET cDNA and the Deduced Protein 
Sequence. The sequence of the overlapping cDNA clones 
corresponding to the MET protooncogene 9.0-kb transcript 
reveals a single open reading frame of 4224 nucleotides (Fig. 
2). The first ATG codon found in this open reading frame at 
nucleotides 1-3 and its context match the consensus se- 
quence for a translation initiation site (16), We assume that 
this is the initiation site for translation, since additional ATGs 
preceding this codon are immediately followed by termina- 
tion codons in all three reading frames. We also find multiple 
termination codons downstream of the open reading frame 
that ends at nucleotide 422S. The 3' 1.8-kb EcoRl restriction 
fragment of the Xmetll clone (Fig. L4) contains the 3'- 
noncoding sequence, poly(A) adenylylation signal, and a 
portion of the poly(A)^ tract (data not shown). 

The predicted primary structure of the MET protoonco- 
gene protein product is similar to other cell-surface growth 
factor receptors (Fig. 2). The first 24 N-terminal amino acids 
with the exception of the alanine (residue 21) are hydro- 
phobic and could serve as a signal sequence for transporting 
the protein into the lumen of the endoplasmic reticulum. By 
analogy to the EGFR or HIR this sequence may be cleaved 
after glycine-24 in MET since signal peptidases frequently 
cleave after this amino acid residue (17). Hydropathy analysis 
also reveals a stretch of 23 amino acids (residues 950-973) 
that are strongly hydrophobic and could serve as a trans- 
membrane domain for the putative receptor (Fig. 2). This 
sequence like other transmembrane domains is followed 
immediately by a cluster of hydrophilic amino acids (Lys- 
Lys-Arg-Lys. residues 974-977) that can serve as a cytoplas- 
mic anchor. This delineates residues 24-9S0 as a putative 
extracellular domain that is analogous to the ligand-binding 
domain of other receptors. 

Residues 973-1408 constitute the cytoplasmic intracellular 
domain and contains the phosphotransferase effector moiety. 
The sequence of this region contains a consensus ATP- 
binding domain and a kinase domain (residues 1101-13S1) 
that is highly homologous with the SRC family of tyrosine 
kinases (6) where the closest match is with the HIR (44%) (7), 
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Fig. 1. (A) Schematic diagram showing Uie TPR-MET oncogene locus and the probes used for analysis of cDNA clones. This diagram is 
a composite of the three overlapping cDNA clones Xmetll (XHOSll), pmetlO. and Xmet5 (XHOSS). The £coRI restriction endonuclease sites 
(E) for each cDNA clone are marked. {B and Q RNA gel blot hybridization analysis of human and MET NIH 3T3-cell transformant mRNAs. 
Poly(A)-selectcd RNA (5 ftg) from HOS. MNNG-HOS. and MCTNIH 3T3 transformant 2212C (CALU-1) cells was hybridized with radiolabeled 
MET protooncogene cDNA fragments, a 1.3-kb EcoRl subfragment from Xmet5 (B) and a 500-base>pair EcoKl-Pst I subfragment of Xmet5 (O- 
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-I0« OAATTCCOCCCTCMCOCCCGC4MOCCCCCaMMOCtT10TCAOCAOATOOOOAIKOOAOtOO^^ .IM 

■IBO CAMTO0(KIO0aMeAQCtOACTTOCT0MA00M>OaKXMM0OC0C0OM»COaK» .1 

1 ATOAAOOCCCCCOCTOTO€TT(K:ACCtGGCATCCTCOTtK:TCCTOTTTACCTT0OT0CAOMGMCAAT0Q00Ad^^ 180 

1 M»tLy» AUrT^UV*ll^nAU?roaiyIl«t«ttV*lL«ttLtttJhtThrl.>ttVi.l01aArg8«r*iaaiyO l^ fcyi^ ^ «0 

i8l TAtCAOCTTCCCAACTTCACCOOWAAACACCCATCCAOAATOTCATTCTACATOAOCATCACATTTTCCTtOOXKCACTAACTACATTTAIGTTTTi^ Q«0 

41 Tyi01aL«orroAiB?h^ThrAl«01nTbrrroll*01iiA«nVklll«Lfl«nt«01aHltKlsXl«n 10 

841 QTWIOAOTACAAOACtOOOCClQtOCTOOAACACCCAOAlffSfttCCCWlOTtAOOAC^^ Seo 

01 V*lAlKti»lyrltf«thr01y>roVfclt»tt01oHi»yroA«rierdrh4>rijey4>l«U»»HCytB4r8«rl^iAl4AaiiL»a8«rO lao 



aei OTTOTCOACACCTACTATOATaATCAACTCATTAOCffSftoCAOOOtCAACAGAGGOACCffSSbAOCOACATOTCTTTCCCCACAAT^^ «aO 
181 VkiVaiAtpthrTyrTyrAspAspOlnLtttIl«8«f££tpiy8«rVftlA»nAr(GlyThiCyiblaAr(Httvairb«rroHlsA leo 



«ai ATAITCrCCCCACAOATAOAAOAOCCCAOCCAdTOTECTGAdRnpTGOTGAOCGCCC^^ eOO 

161 ll«fh*8«rrro01aXltOla01«rre8«raiii|C2a|rroAa|fe2^alv«i8irAUt«B01yAUI^ty«ll.««8«r8arVAlLycAapA^ 800 

601 ATAAATTCTTCTTATTTCCCAOATCArCCATTOCATT0OATATCACT0AOAAi(K»CTAAA0CAAACOJUUU>AT0CTTTTATOTTTTtOACOOACCAOTCCTACATTOATOTTTTATO 780 

801 IlaAinStr8«rtyryharroAipHt8rr0LitmtiSarll«8«rv«iArgArgLtulytOlaTbrLytAipOlyyht8a(rhaLa«TbrAapOln8«rTyrlt«ABpv»iL««rroOlo 840 

781 TTCAOAOATTCTTACCCCATTAAOTATOTCCAlOCCTTTOAAAOCAACAATTTTATTTACTTCTTOACGCTCCAAAOOOAAACTCTAaATOCrCAOACTTTTCACACAAOAAtAATCAOO 840 

841 nkaAr|A8p8arTyryroXlaLystyrvalHiaAUrhaoiv8arAaiiAanPhaIlaTyrrhaLa«ThrVaioinAr(OI«Tferi,a«AapAlaOI»TbrybaKlaTbrArfllatl«Arf 860 

04 1 TT(fi3%CCATAAACTCTGOATTOCATTCCTAC ATQOAAATOCCTCTOOAdTSIkTTCTC ACACAAAAGAOAAAAAAOAGATCCACAA^ 060 

asi rh«|C2;s^«rIl«AioS«rOlyL*vM)tS«rTyrHttOlttH«trroLtQ01^Cjrs|xi«L»«Thr01oZ,yiAr(LysLyBArg5« 980 

081 tATqTCAOCAAOCCttMOOCCCAOCTtOCTAOACAAATAOOAOCCAOCCTOAATOATOACATTCTTTTCOOGGTaTTCOCACAAAOCAAOCCAOATTCTOCCOAACC^ 1080 

S81 TyrV»ls«rl.yayroOtyAlaOliiL««AUAr(0lAlle01yAl«6«rL««ABnAipAapIl«L«ttPh«aiyVftirhsAlaGln8«rLy«rroAapS«rAlA01«troHatAapArg8«r 880 



1081 0CCAt(ffS1!tK:ATTCCCTATCAAATATOTCAACOACTTCTTCAACAA(MTCOTCAACAAAAACAAT0T0AaAff^ 1800 
sei Al»Htt |Cy«K l»yh«yroIl»LyiTyrV«IAinAipyh<yhtA8ftLy8ll«VtlA»nty«AinA»nV*lAr flCyii L»nQlnHlf >h»Tyr01yf roAanHtaOlgHt jcyay 400 



1801 ACACttCTOAOAAATTCATCAOOdRnbAAGCGCOCCOTOATOAATATCOAACAOAOTTTACCACAOCTTTCCAOCOCOTTOACTtATTCATOOOTCAATTCAOCOAA^ 1880 

401 Thrta«La»Arg A«nS*rg>rO l3i |Cyap iaAUArtArfAapOlaTyrArtThyQlt>PhaThrThrAULaBOlaArfV>lAapLattyhaMatOlyQln>ba8ar01«V*ltau^^ 440 

1381 TCTATATCCACCTTCATTAAAGOACACCTCACCATAOCTAATCTTt30OACATCAOAQ<X>TCOCTTCATOCA00TT0T00TTTCTC0ATCA00ACCATCAACCCCtCAXGt^ 1440 

441 88rtta8arThryhaxiatjiGlyAapLanThr2laAUAaaLaaoiyThrSaroi«01yArfybaH«toiAV«}vaivai8arArf8aiiilyfroStrThrProHiavalAaaPbaLas 480 

1 44 1 CnOACTCCCATCCAOTOTCtCCAOAAOTOATTOTGOAOCATACATTAAACCAAAATOOCTACACAGTOOTTATCACTOOOAAOAAGATC AOGA AOATC^ 1 880 

481 U«AipSarHl8PrpVai8arPrQ01«vaixiaVaioiaRiaThrtaaAaiiOUAsaGlyTyrThrLiavaixiaThroiyi.yaLyazi«nirLyallarroLaaAaB01ytatf 880 

1861 A0ACATTTCCA0TCC(reCtoTCAAffCefcTCTCTOCCCCACCCTT»TTCA(ffS%OCT^^ 1680 

981 Ar<Hia>haOln»af lcya8 aroii Cyafc att8arAl»Proyrcmv>ioii teyA lyTri fcyA laAipLy AyaV alArt8»roiBOi^^ 860 

1681 ffSTtTOCCTOCAATCT AC AAOOTTTTCCCAAAIAOTGC ACCCCTTGAAGGAOOOAC AAGOCItlACC AT«ff5fb^ TOOO ACTTTOOaTT TCOOAOOAATAAT AAATTTOATTT AAAOAAA 1 600 

861 jcy«^ ^oy^0A^*Il^Ty^Ly«v>l>h♦y^0A«ftS^^A^*y^0L^o0^qQ^yO^yTh^A^^t^oTh^Il ^Cy«p lyT^pA»pyh^GIy>h^A^gA^^A^^A»nLyIyl>^A^pL^aLy■Ly« 600 

1801 ACTAOA0TTC7CCTTGOAAATGAOA(KffG€KCCTTOACTtTAAGTCAaAGCACOATGAATACATTGAA#ff09ACAQTT0OTCCTGCCATO 1880 

601 TbrAr gv« 1 LavLavO l yAano i ttS • ifcy ag brLaaTbr lans a rO i aS • r TbrMa t ai nTbrW»^ wCytf brv* IG lyV roA 1 »ita i AanlyaH I afba AaaMatSar iiaxlaXla 840 

1081 TC AAATOOCCAOOOOACAACACAATACAOTACATTCTCCTATOTOOATCCTOTAATAACAAOt ATTTCOCCOAAATACtMTCCTATaOCTOOXOOCACrTt 8040 

641 8arAaB01yHtaOlyThrtbrolnTyr8*rTbrrb*8BrTyrVftiAipyroVftliiaTbr8«rIl«8«ryrotyttyi<»lyyroM*tAl«Oly01ytbrttuX^QTbrL^ 680 

8041 TACCTAAACAOtOOOAATTCTAOACACATTTCAATtaOTOOAAAAACMffSTKCTTTAAAAAOTOTOTCAAACAOTATTCTTOAW^^ 81 60 

681 Tyrta«A8a8aitilyAaKSarAr|Niaxia8ariiaOiyoiyl^aTbi{cz8pbrta«Xorasarvai8arA8a8arZlaX.a«01ig|c^ V80 

8101 GCTOTTAAATTOAAAATTOACTTAOCCAACCOAOAOACAACCATCTTCAGTTACCGTGAAOATCCCATTGTCTATOAAATTCATCCAACCAAATCTTTTATTAaTACTTOOTaOA 8880 

781 Al4ValLy8L«iiXy8Xl8A8pX.*aAlftA8nArgoiQTbrSarll«rbaS«rTyrArg01iiA«prroIl*ValTyrOliiX)«HlayroTbrLyBS8rPb»Zla8«rTbrTrpTrpLy801ii 760 

888 1 eCTCTC AAC ATTOf CAOtTTTCT ATTlK^TTOCCAOtOatOOOAOCAC AATAACAMTOTTQOOAAAAACCTGAATTCAOTTAGTOTCCCOAOAAt^ 8400 

761 ProlavAanllavalSarPbaLattPbiQrifbaAlaSarOlyOlySarTbrnaTbroiyvaioiyXjpaAsaLavAabSarvaiSarVairr^ 800 

8401 OOAAOGAACTTTACAGTOOC/^fSffcAACAtCOCtCTAATTCAGACATAATqrGTIRSTKCCACTCCTTCCCTGCAACAOCT^ 8980 

801 01yArtAanybaIbrv»IAl4^ y ipinHi»ArtSirA»nS»rolttlltl l 4cy»py«g hrThryros<rL»tt01irfilnL>TLA8nL»BOlril.«ayreL«ttLyBTbrl.yaAl*ybtrh»Ma» 840 

8881 TTAaATOOOATCCTttCCAAATACTTTOATCTCATTTATOTACATAATCCTOTOTTTAAOCCTTTTOAAAACCCAGTOATOATCTCAATGOOCAATOAAAATOTACTOOAAATTAAaOOA 8640 

841 l^aAapOlyllaLaaSarLyatyryhaAspXAvilaTyrValHIsAanProValvbatyaProPbaOlaXoraProVaiNatXlaSarKatoiyABnaiaAanValLaaOU 880 

8641 AAT0ATATT0ACCCT0AAOCAOTTAAA0QT0AA0TOTtAAAA0TTQGAAATAAOACC|nTbAaAATATACACTTACATTCT0A^ 8760 

i8aigy«p i 



881 AaaAapIlaAapProOlvAlaVal Ly aO 1 yG 1 «v« 1 LaaLy ava 10 1 yAaatya8ai|Cyap laAaaXUHlaLaoKtaSarGlaAUVal Lai yya^ brVa 1 Pr oAaaAa pLa«La«Ly8 880 

878 1 TTGAACAOCGAOCTAAATATAOAOTOOAAOCAAaC AATTTCTTCAACCOTCCTtOGAAAAOTAATAOTTCAACCAOATCAGAATTTCACAOOATXOATTOCTOOtOr^ 8880 

881 LanAanSarol«LaManXlaClnTrpLy8GlnAlaxiiSarSarTbrvaiLa«aiyx.y8ValIlaV«i0lBrroA8p0lBAaafb«^^ 860 

8681 ACAOCACTQTTATTACTACTTGOOTTTrTCCTOTOOCTOAAAAAOAOAAAOCAAATrAAAOAtCTQOOCAOTOAATTAOTTOCCTACOAT^M 30OO 

801 TbrAi*L*qLtqL»qL»Tit»Tioiyyhtybfli,«aTrpLattL yaX.yaArgtyaQlaIlaty8AapLaaoiy8atoiaL8ttValArtTyrAapAUAr<valHiaTbryreMlal^8AapAr^ 1000 

SOOl CTTGTAAaTOCCCo!u!oTGTAAGC^ 3180 

1001 t<aV*ISarAl»ArtfS<rV*IStTyroThrThrClaMttV*lSirA»nGlTiS*rV*lAapTyrArgAI>TbrFbayroGlAapQlnybayroA«B8ar8Br01nAii>01y8»i yyah rg 1040 

3181 CAAGTOCAOTATCCTCTGACAGACATOTCCCCCATCCTAACTAOTaOOGACTCTOATATATCCAGTCCATTACTOCAAAATACTOTCCACATTOACCTCAOTGCTCTAAATCCAOAOCTO 3840 

1041 OlBValOlBTyrProX.attTbrAapMatsarProXlaLaaTbr8avOXyA8pSarAaplla8arSarrroL««L*«oiaA8nTbrvaiHi8XiaAapLa«6arAULa«AaarrooimLatt 1080 

3841 0TCCMK>CA0TGCA0CATGTAGT0ATT0aOCCCAOTAOCCTOATT0TGCATTTCAAT0AA0TCATA0GAAaA00OCATTTT00lff91bTATATCATaQ<^ 3360 

1081 v*lQliU^Uv»lQlnHi»v»lvaUU01y>roStrSarLaalUV»IHt8rhaA8BQInV*lIlaGlyArgGlyHl8PhiQl| |cyay *lTyrHlaQlyThrLa»L»qAtpAaaAap01y llSO 

3361 AAOAAAATTCAdf59KTOTGi(^TCCTT0AACAGAATCACT0ACATA00AOAA0TTTCCCAATTTCTGACCCAG0GAATCATCATGA 3480 

1181 iyaLyaIlaHi^Cx^laV«iLy88artaaAanAr|XlaTbrAapXla01yOloValSarOlarbaLaaTbroi«GlyXlaZlaii8tX.y8A8pPbaSarHiarr^^ iieo 

3481 CTOOGAATdreCtTOCOAAOTOAAOOOTCTCCOCTGOTOOTCCTACCATACATGAAACATOOAGATCTTCOAAATTTCATTCOAAATOAGACTCATAATCCAACTO 3000 

1161 LauOlyX I^C£a^auArg8«rOltiO)y8«rrrot»nValvaiL««rroTyrHatLysKla01yAapL«aArgAsnrh«IltArgA8nGlalbrHl8AfD7roTbrvalLyBA8pt««Il« laoo 

601 aqCTTTOOTCTTCAAOTAOCCAAAOCGATGAAATATCTTOCAAOCAAAAAOTTTOTCCACAOAiGACTTOOCtOCAAOAAACffSIS^TOGt^^ 3780 

. 801 O lyPbaO lyLavO I «va l a l aLyaA 1 > MttLy«TyrL»ttAiy TLyaLyifb^v^l H l a Arg AapLavA 1 aA 1 aArfAa Hcy^ a t tauAa pO 1 vl^aVb aTb rva 1 Ly aVa I A 1 aAap 1 840 

3781 TTTOOtCTTOCCAOAOACATOTATOATAAAOAATACTATAOTGTACACAACAAAACAOOTOCAAAGCTOCCAOTGAAGTOOATOCCTTTOOAAAOTCTOCAAACtCAAA^ 3840 

1841 PbaOlyLauAlaArf A8pW«tTyrAapLyi01wTyrTyrSarValHiiA«nLyiThrOlyAl»I.ytLaorroV»iLy»TrpiiatAUX.aoOltt8«rI.«»OUThrGlnLy«tb»IbrThr 1880 

3841 AAOTCAaATOZOTOOTCCTTTOOCOTCOtCCTCTOaOAOCTOATOACAAOAOOAOCCCCACCTtATCCTOACOTAAACACCTTTOATATAACTOTTTACTtOrW 3860 

1881 LyaSarA8pVaiTrp8arrbaOlyValvaiia«Trpoi«x.aaHatTbrArgoiyAlarrePreTyrrroA8pValAaatbrPbaAapilatbrVaiTyrXianLaaoiaOlyAr|ArfLa« 1380 



3861 CTACAACCOOAATAi 
1381 taaOlaVroOlttTy 



dRJCfcCAGACCCCtTATATOAAOTAATOCTAAAAjfSCbGOCACCCTAAAOCCOAAATGCOCCCATCCTTTTCTO *060 

rfeytf roAap>roLaaTyi01av*lliattaatyi fey>p rpHtaProLyiAUQlaWttArgyroSir>haSart?ltttanv»lS>rArgriaSarAlallarba8 1360 

6081 ACYTTCATTOOOOAOCACTAtOTCGATOTCAAOOCTACTTATOtOAACOTAAAifiofbrcOCTCCOTArCCTTCtCTOTTOrCAZCAOAAOAT^ 4800 
1S61 Ibr»baXlaOlyOl»Hl8TyrValHlaValAaaAlaThrTyrvalAaftValLyifeap4lAlaFroTyryro8arla«t««8ar8arOIaAapAaBAlaAapAap01«valAaptbrAr| 1400 

6801 CCAOCCtCCTTCTOOOAOACATCAtAOTOCTAOfACtATOTCAAAOCAACAOTCCACACTTtOTCCAATaOTtTTTTCACTOCCTOACCTTTAAAAOOCCATCOAtATTCTTTOCteCTT 4380 
1401 ProAlaBarybaTrpOlaTbrSar 1408 

4381 OCCATAOOACTtOTATtOTTATTTAAATTACTOOATTCTAAOOAATTTCTTAtCIOACAOAOCATCAOAACCAOAOOCTTOOTCCCACAOGCCAOOOACCAAIOCOCW 4438 

Fig 2. MET prolooncogenc cDNA clones, nucleotide sequence, and deduced amino acid sequence. Nucleotides are numbered above and 
amino acids arc numbered below the line starting with the first methionine of the open reading frame. The putative si^al sequence and 
transmembrane domain are underiined with a solid bar. The consensus sequences for Nrlinkcd glycosylation are underlined, and the 
oligonucleotide primer is shown as 1. Cysteine residues are boxed. Putative ATP binding domain: lysine; Gly-Xaa-Gly-Xaa-Xaa-^iy. 
Sequence analysis was performed using the ANALSEQ and IDEAS program packages of R. Staden (Medical Research Council. U.K.) and M. 
Kanehisa (Kyoto University. Japan), respectively. Alignments of protein sequences were performed with the ALIGN program ol B. Urcutt, 
M . 0. Dayhoff, D. G. Geoige. and W. C. Barker (National Biomedical Research Foundation. Washington. DC). 
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Fig. 3. Comparison of the putative tyrosine kinase domain of the MET gene product (residues 1103-1366) with the corresponding domains 
of HIR, human EGFR (HEGFR), and the products of the y-abt and y-src genes. All residues shared between MET and two or more of the other 
kinases are boxed. The aligned sequences show Komolo^es of 44% with HIR, 38% with HEGFR, 41% with v*iifr/, and 37% with v-src. Hyphens 
are gaps inserted to maximize the number of matches in the comparison. 



the murine w-abl gene (41%) (8), and the human EGFR (38%) 
(12) (Fig. 3). There are two tyrosine residues (positions 1252 
and 1253) in |he MET kinase domain that could correspond 
to tyrosine-416, the major site of autophosphorylation in 
pp60^ *" (6). However, it is not yet known which tyrosine 
residue is phosphorylated in the MET oncogene or proto- 
oncogene product. A putative ATP-binding site at Lys-1127 
is 20 amino acids downstream froni a consensus Gly-Xaa- 
Gly-Xaa-Xaa-Gly (6) (residues 1102-1107). When compared 
with other kinases, the ATP-binding domain in MET is 
further fi'om the transmembr^e domain (130 amino acids), 
for example, only 49 amino acids separate the ATP-binding 
domain from the transmembrane domain in the EGFR/ 
transforming growth factor type a receptor (12) or HIR (7) 
(Fig. 4). Moreover, unlike other known tyrosine kinases, 
MET contains a cysteine in the ATP-binding domain (residue 
1126) instead of a valine residue. 

The breakpoint of the DNA rearrangement that activates 
the TPR-MET oncogene (3) is located at nucleotides 
3081/3082 in the METprptooncogene sequence (Fig. 2). This 
breakpoint is located 54 amino acids from the transmembrane 
domain; thus, the TPR-MET oncogene contains essentially 
only the kinase domain from the MET protooncogene and not 
the transmembrane domain. This correlates with protein data 
that shows that the TRP-MET oncogene protein product is 
not surface iodinated, is located in soluble cytoplasmic cell 
fractions, and has kinase activity (M. Gonzatti*Haces, per- 
sonal communication). The structure of TRP-MET is differ- 
ent from that of the v-erbB or w-ros (18), oncogenes that 
contain transmembrane domains and have lost only the 
extracellular ligand-binding domain, although we expect 
from other studies (M. Gonzatti-Haces, personal communi* 
cation) that TPR contributes coding sequences. 

DISCUSSION 

In this paper we predict the protein structural outlines of the 
MET protooncogene product based on nucleotide sequence 
of cDNA. We isolated overlapping cDNA clones that corre- 
spond to the 9.0-kb MET transcript expressed in HOS cells 
(3). The complete cDN A sequence has an open reading frame 
of 4224 nucleotides that could code for a protein of 157 kDa. 
This is consistent with the size of the MET protein products 
(110, 140, and 160 kDa) that have been detected with a 
C-terminal peptide antiserum (ref. 9 and M. Gonzatti-Haces, 
personal communication). The primary amino acid sequence 



shows structural features characteristic of the growth factor 
receptor tyrosine kinase gene family (Fig. 4). 

We had noted that sequence comparisons have revealed 
homologies between the primary structures of receptors. To 
determine if MET was at all homologous to known receptor or 
receptor-like proteins, we have searched the National Biomed- 
ical Research Foundation protein data bank^ and directly 
compared the MET protooncogene with the EGFR (12, 19), 
HIR (7), human insulin growth factor type 1 receptor (20), 
colony-stimulating factor type 1 receptor (21), platelet-derived 
growth factor receptor (22), low density lipoprotein receptor 
(23), transferrin receptor (24), and the epklennal growth fiactor 
precursor (25) using the ALIGN program of Dayhoff et al, (26). 
This direct comparison revealed no significant homology with 
any of these proteins outside of the kinase domain. A compar- 
ison of th^ structural features of the MET kinase domain with 
v-//m/colony-stimulating factor type 1 receptor (21), platelet- 
derived growth factor receptor (22), or v-kit (27), which have a 
split kinase domain (Fig. 4), showed that the MET kinase is not 
interrupted (Figs. 3 and 4). This suggests that MET should be 
included in the kinase subgroup that includes the EGFR/ 
transforming growth factor type a (12, 19), HIR (7), human 
insulin growth factor type 1 receptor (20), and y-ros (18) that 
have a single kinase domain. 

In comparison with other receptors, MET contains a large 
putative extracellular domain with a characteristic array of 
cysteine residues (Fig. 4). Cysteine-rich clusters are thought 
to form an essential structural backbone for the ligand- 
binding domain. In contrast to the platelet-derived growth 
factor receptor or colony-stimulating factor type 1 receptor, 
which have cysteine residues spaced throughoiit the extra- 
cellular domain, the EGFR/transforming growth factor type 
a receptor or HIR contain cysteine-rich domains (Fig. 4), 
Although the putative extracellular domain of MET contains 
one small cysteine-rich domain (11 out of 105 amino acids, 
residues 520-625; Figs. 2 and 4), the msgority of the cysteine 
residues (22 out of 34) are dispersed. The MET protein, 
therefore, appears to contain cysteine density features in 
common wiUi both classes of receptors. 

Although three MET-related polypeptides of 110, 140, and 
160 kDa have been observed in human cell lines only the 
140-kDa MET is labeled when cells are sur^u:e iodinated and is, 

^Protein Identification Resource (1986) Protein Sequence Database 
(Natl. Bionned. Res. Found., Washington. DC). Release 10. 
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Fig . 4. Schematic comparison of MET protooncogene with other 
cell-surface receptors. Cystcine-rich domains are shown in hatched 
boxes; other cysteine residues in the extracellular domain are 
represented as solid circles. Tyrosine kinase domains are cross- 
hatched boxes. CSF-1, colony-stimulating factor type 1; PDGF, 
platelet-derived growth factor; TGF-a, transforming growth factor 
type a. 

therefore, present on the cell surfiace (ref. 9 and M. Gonzatti- 
Haces, personal communication). The 140-kl)a MET product 
could correspond in size to the protein predicted from the MET 
open reading frame (157 kDa), suggesting that it is not exten- 
sively posttranslationally modified. The putative extracellular 
domain oif MET contains 11 potential sites for N-linked 
glycosylation (Fig. 2). but the 110-, 140-, or 160-kDa products 
do not appear to be heavily glycosylated (M, Gonzatti-Haces, 
personal communication). Analysis of the predicted MET 
protein secondary structure using the program of Stephens (28) 
with the algorithm of Hopp and Woods (29) showed that none 
of the potential N-linked glycosylation sites in MET occur in 
strongly predicted ^-tum structures. Since studies have shown 
that =67% of N-linked glycosylation sites of proteins occur at 
^-tum structures (30) few of the potential sites in MET may be 
available for glycosylation. 

These data are consistent with the proposal that the MET 
protooncogene represents a transmembrane protein that may 
act as a receptor for an as-yet-^unknown ligand. Given that 
phosphorylation of tyrosine and serine/threonine may play 
important physiological roles for several growth factor re- 
ceptors and protooncogenes, the fact that the MET proto- 
oncogene is expressed predominantly in human fibroblast 
and epithelial cell lines and rarely in cell lines of hematopoi- 
etic lineage (3) indicates that MET may be a cell-lineage- 
specific receptor for a growth or differentiation factor. 

The MET oncogene locus was shown to be closely linked 
to the inheritance of the recessive genetic disorder cystic 
fibrosis (31) thus localizing the cystic fibrosis locus to human 
chromosome 7q21-31 (5. 31). Using MET cDNA we have 
identified a new restriction fragment length polymorphism 
(RFLP) (32) «»100 kb from MET oncogene probes originally 
used for RFLP analysis and prenatal diagnosis. This RFLP 
will increase the confidence limits with which these analyses 
can be performed. 
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